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ABSTRACT: 

Method and apparatus for automatic recognition of handwritten text based on a suitable 
representation of handwriting in one or several feature vector spaces(s), Gaussian modeling in 
each space, and mixture decoding to take into account the contribution of all relevant prototypes 
in all spaces. The feature vector space(s) is selected to encompass both a local and a global 
description of each appropriate point on a pen trajectory. Windowing is performed to capture 
broad trends in the handwriting, after which a linear transformation is applied to suitably eliminate 
redundancy. The resulting feature vector space(s) is called chirographic space(s). Gaussian 
modeling is performed to isolate adequate chirographic prototype distributions in each space, 
and the mixture coefficients weighting these distributions are trained using a maximum likelihood 
framework. Decoding can be performed simply and effectively by accumulating the contribution 
of all relevant prototype distributions. Post-processing using a language model may be included. 
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© A statistical mixture approach to automatic handwriting recognition. 

© Method and apparatus for automatic recognition of handwritten text based on a suitable representation of 
handwriting in one or several feature vector spaces(s), Gaussian modeling in each space, and mixture decoding 
to take into account the contribution of all relevant prototypes in all spaces. The feature vector space(s) is 
selected to encompass both a local and a global description of each appropriate point on a pen trajectory. 
Windowing is performed to capture broad trends in the handwriting, after which a linear transformation is applied 
to suitably eliminate redundancy. The resulting feature vector space(s) is called chirographic space(s). Gaussian 
modeling is performed to isolate adequate chirographic prototype distributions in each space, and the mixture 
coefficients weighting these distributions are trained using a maximum likelihood framework. Decoding can be 
performed simply and effectively by accumulating the contribution of all relevant prototype distributions. Post- 
processing using a language model may be included. 
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The invention is in the field of signal processing, and specifically is directed to handwriting recognition. 
In particular, the invention is directed to handwriting recognition based on a suitable representation of the 
handwriting in one (or several) feature vector spac (s), Euclidean and Gaussian modeling in each spac , 
and mixture decoding to take into account the contribution of all relevant prototypes in all spaces. 
5 Automatic systems purporting to recognize cursive script writing, or even handwritten characters, have 
so far met with only limited success. The reason for that can be traced largely to the lack of robustness 
exhibited by the templates used in the modeling of handwriting. For example, reference is made to U.S. Pat 
No. 4,731,857 which describes a three -step procedure for the recognition of run-on handwritten char- 
acters. First, potential segmentation points are derived. Second, all combinations of the segments that could 
70 reasonably be a character are sent to a character recognizer to obtain ranked choices and corresponding 
scores. Third, the character sequences are combined so that the best candidate word wins. The recognition 
algorithm itself is a template matching algorithm based on dynamic programming. Each template is a fully 
formed character presumably representative of the writer's average way of forming this character, and the 
elastic matching scores of the current character are computed for each template. This strategy is vulnerable 
75 to the extensive variability that can be observed both across writers and across time. 

It is therefore an object of the invention to provide an improved handwriting recognition system. The 
object of the invention is solved basically by applying the features laid down in the independent claims. 

The invention particularly provides an improved handwriting recognition system in which input hand- 
writing is signal processed to extract both local and global feature elements and particularly an improved 
20 handwriting recognition system in which windowing is utilized to detect broad feature events as well as an 
improved handwriting recognition system in which rotation/projection is utilized to eliminate redundancy and 
especially an improved handwriting recognition system in which handwriting is recognized^ based on 
Gaussian modeling in a suitable feature vector space, called a chirographic space. 

Further the invention especially provides an improved handwriting recognition system in which mixture 
25 decoding is utilized to take into account the contribution of all relative prototypes in a chirographic space 
and particularly an improved handwriting recognition system in which multiple codebooks, corresponding to 
multiple chirographic spaces, are utilized to provide a more robust recognition. 

The present invention is directed to devising a fast algorithm for handwriting recognition (i.e., of 
complexity similar to that of elastic matching as disclosed in U.S. Patent No. 4,731,857 to Tappert) with an 
30 acceptable degree of robustness. 

This particularly entails at least three crucial specifications: (i) the feature elements should be chosen 
such as to characterize handwriting produced in a discrete, run-on, cursive, or unconstrained mode equally 
well; (ii) these feature elements should be suitably processed so as to minimize redundancy and thereby 
maximize the information represented on a per - parameter basis; and (Hi) the resulting feature parameters 
35 should be further analyzed to detect broad trends in the handwriting and enable appropriate modeling of 
these trends. These specifications are not met by the current elastic matching approach, since (i) it is 
character -based, and (ii) it simply averages several instances of a character to obtain a character template. 

According to the present invention, the signal processing front -end is a great deal more sophisticated 
than that of elastic matching. Rather than merely chopping the input data into segments, the signal is 
40 transformed onto a higher dimensional feature space (chirographic space), whose points represent all raw 
observations after non - redundant feature extraction. Using a Gaussian (as opposed to Euclidean) measure 
for a more refined clustering, the prototypes in this space are formed for robustness purposes. Hence, each 
prototype represents a small building block which may be common to many characters. Instead of 
character sequences, building block sequences are combined, each of which is assigned a true likelihood 
45 defined on a bona fide probability space (as opposed to just a distance score). Finally, the recognition 
algorithm itself is a maximum a posteriori decoder operating on this probability space. This alternative 
strategy is better suited to meet specifications for robustness. 

Automatic recognition of handwritten text is based on Euclidean and Gaussian modeling in a suitable 
feature vector space, and mixture decoding taking into account the contribution of all relevant prototypes in 
so this space. 

To obtain feature elements adequate for all modes of writing, the difference in coordinates from one 
point to the next is selected, as well as slope and curvature information, which are all translation - invariant. 
This provides adequate information about the local behavior of the pen trajectory. At the same time, it is 
important to collect information about the global behavior of the pen trajectory. Becaus in the case of 
55 cursive or unconstrained handwriting this information may be more difficult to estimate, the corresponding 
global feature elements should be handled separately from the local feature elements. In both cases, 
however, to minimize redundancy, the total covariance matrix of the resulting data is computed, the 
associated eigenvalues and eigenvectors are found, a linear transformation is applied along the directions of 
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these eigenvectors, and the data is projected onto a sub - space of lower dimension. Finally, to detect the 
important building blocks in the handwriting, the reduced feature vectors are clustered into a comparatively 
small number of so - called chirographic prototypes, leading to a chirographic label alphabet representative 
of the elementary building blocks used in the writing process. 

The above strategy can be implemented either on a writer - independent or on a writer - dependent 
basis. In the latter case, a set of chirographic prototypes is derived for each writer. This set is expected to 
capture the idiosyncrasies of the corresponding writer. 

To initialize the algorithm, the assumption is made that some handwriting has been recorded, so that for 
each stroke a sequence of (x,y) coordinate, corresponding to the current trajectory on the tablet, is 
available. Once the chirographic prototypes have been derived, recognition can be performed by simply 
computing the likelihood of each prototype producing the current feature vector, and mixing each individual 
likelihood together to obtain the final likelihood. In this approach, the mixing coefficients are the probabilities 
that each character in the vocabulary gives rise to a given prototype in the chirographic label alphabet. 

In the following preferred embodiments of the invention are described in more detail with reference to 
the drawing in which 

Fig. 1 provides examples of five different types of handwriting that may be input to the handwriting 

recognition system of the present invention; 
Fig. 2 is a block diagram of a generalized handwriting recognition system emphasizing training and 

decoding paradigms; 

Fig. 3 is a block diagram of a handwriting recognition system according to the present invention; 
Fig. 4 is a detailed block diagram of the front -end parameter extraction block which is shown 
generally in Fig. 3; 

Fig. 5 illustrates a ballistically spaced character which is input to the pre -filtering blocks of Fig. 4; 
Fig. 6 illustrates an equally spaced character which is output from the pre - filtering block of Fig. 4; 
Fig. 7 illustrates how the top 1/4 of the ballistically spaced character of Fig. 5 is transformed to the 

equally spaced character of Fig. 6; 
Fig. 8 is a flow chart detailing how the pre -filtering block of Fig. 4 functions to transform the 

ballistically spaced character of Fig. 5 to the equally spaced character of Fig. 6; 
Fig. 9 illustrates a handwritten character being processed according to the prior art to generate a 

feature vector for a point (P); 
Fig. 10 illustrates a four dimensional handwriting feature vector generated in the prior art for the point 

(P) of Fig. 9 by collecting a plurality of spatial attributes; 
Fig. 11 illustrates a portion of a handwritten character being processed according to the teachings of 

the present invention to generate a first parameter vector for a point (P); 
Fig. 12 illustrates a six dimensional local parameter vector generated for the point (P) of Fig. 11 by 

collecting a plurality of local spatial attributes; 
Fig. 13 illustrates a handwritten character being processed according to the teachings of the present 

invention to generate a second parameter vector for a point (P); 
Fig. 14 illustrates a three dimensional global parameter vector generated for the point (P) of Fig. 13 

by collecting a plurality of global spatial attributes; 
Fig. 15 illustrates how windowing is accomplished on a character by concatenation of individual 

parameter vectors as extracted in Figs. 12 and 14; 
Rg. 16 is a flow chart detailing how the windowing block of Fig. 4 functions to perform the 

concatenation of the parameter vectors illustrated in Rg. 15 and thereby produce spliced 

vectors; 

Rg. 17 is a flow chart detailing how the projection block of Rg. 4 functions to produce a feature 

vector from the spliced vectors obtained in Rg. 16; 
Rg. 18 is a detailed block diagram of the prototype construction block of Rg. 3; 
Rg. 19 is a diagram illustrating K- means clustering; 

Rg. 20 is a flow chart detailing how the Euclidean K- means clustering block of Rg. 18 functions; 
Rg. 21 is a flow chart detailing how the Gaussian K- means clustering block of Rg. 18 functions; 
Rg. 22 is a flow chart detailing how the likelihood estimator block of Fig. 3 functions; 
Rg. 23 is a flow chart detailing how the decoder block of Fig. 3 functions. 

In handwriting recognition, handwritten characters generally fall into five groups depicted in Fig. 1 , the 
groups being depicted in increasing order of recognition complexity. Specifically, these groups include a 
first type of writing (W1) known as box discrete wherein individual characters are formed within predefined 
areas, or boxes, thereby simplifying the task of character segmentation. A second type of writing (W2) is 
known as spaced discrete wherein the user intentionally forms each character such that no character 
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touches another. A third type of writing (W3) is known as run-on discrete wherein the user may form 
characters that touch, or "run-on" to, one another. A fourth type of writing (W4) is cursive writing wherein 
the user normally writes the whole word and subsequently crosses the fs and dots the i's and j*s. Finally, a 
fifth type of writing (W5) is unconstrained writing wherein the user may use a mixture of run-on and 
cursive writing. This last type is the most difficult and presents the most complex segmentation and 
recognition task of the five styles depicted in Fig. 3. 

Referring to Fig. 2 there is illustrated in block diagram form, the operation of a general handwriting 
recognition system. A generalized discussion of Fig. 2 is first provided followed by a detailed description of 
the operation of each of the blocks shown therein. At block 2 there occurs data acquisition of stylus or pen 
stroke information. Acquired strokes are operated on to recognize the handwriting information. During a 
training mode of operation as shown at block 4 the acquired handwriting information is analyzed, in 
reference to a known, training script, to train the underlying models purporting to represent this information. 
During use, the model parameters obtained during training are used by a decoding block 6, together with 
the feature vectors corresponding to the (unknown) handwriting to be recognized. Specifically, the decoding 
block 6 employs the models to generate candidate characters from assemblages of individual feature 
vectors. The decoder selects the most probable character based upon the relevant model parameters. 

Recognized handwriting is thereafter made available for use by block 8. By example, a recognized 
message may be simply converted to an alphanumeric format and displayed upon a display device. The 
recognized message may also be provided to any application that would conventionally receive messages 
from a keyboard such as, by example, a word processing system. 

The following algorithm is implemented in the apparatus comprising the handwriting recognition system 
of the invention. The steps of the algorithm are as follows: 

1. Perform a pre -filtering of the data to normalize for the speed of writing. This is accomplished by 
converting the time - dependent representation captured by the tablet, where the spacing between points 
is ballistic in nature, into a time - independent representation, where all the points are equally spaced. 
Linear -interpolation is performed as necessary to find the resulting equally spaced points, tf desired, a 
cubic spline interpolation can also be performed for a more refined interpolation. 

2. For each point P n of coordinate (x„,y„) in the training data, forms a P- dimensional vector p n of 
feature elements representative of the local pen trajectory around P n . For example, a good choice for P 
is 6, with feature elements given by: (i) the horizontal and vertical incremental changes: 

Ax n = x n -x n . u Ay n = y n -y„_ i; (1) 

(ii) the sine and cosine of the angle of the tangent to the pen trajectory at P„: 
COS e n = AxM{Ax n r + (Ay„r, sin e n = Ay^Ax^ + fAy^; (2) 
and (Hi) the incremental changes in the above two parameters: 
A cos e n = cos e n+1 -cos 6„- u A sin e n = sin 0 n+1 - sin e^. (3) 

Note that the last two parameters provide information about the curvature of the pen trajectory at point 

Pn- 

3. For each point P n of coordinates (x„,y n ) in the training data, form a P - dimensional vector P' n of 
feature elements representative of the global pen trajectory up to P„. For example, a good choice for P 
is 3, with feature elements given by: (i) the height from the baseline y n , (ii) the width from the beginning 
of the stroke x„-x h where x, is the first coordinate of the current stroke, and (iii) the inter -stroke 
distance if the current character is composed of more than one stroke. 

4. For each stroke in the training data, determine a subset of the points P n in that stroke, say Q h with the 
property that the O/s are approximately equally spaced. This set should always include the first and last 
points of each stroke, and the spacing interval should be some reasonable function of the line height. 

5. At each location 0/ obtained in Step 4, construct a Q - dimensional spliced vector by concatenating 
together the H vectors p n preceding Q„ the vector q t corresponding to Q b and the H vectors p n 
following^ Q,. Similarly, construct a Q' - dimensional spliced vector by concatenating together the H 
vectors p' n preceding Q h the vector q % f corresponding to Q h and the H vectors p' n following Q h This is 
realizable provided the following holds: 

Q = P(2H + 1), Q' = F(2H + 1). (4) 
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Good choices are H = H - 20, yielding values Q = 246 and 0' = 123. 

6. Compute the mean vector and covariance matrix of all the Q - dimensional vectors corresponding to 
local handwriting features. Denote these as MP and $P\ respectively. Similarly, compute the mean 

s vector and covariance matrix of all the 0 1 -dimensional vector corresponding to global handwriting 
features. Denote these are MP and SP, respectively. 

7. For n = 1,2 compute E/ n) , the eigenvector matrix of S/ n> , and AP, the diagonal matrix of 
corresponding eigenvalues. Note that these quantities obey the relationship: 

w sP=eP\PePt, (5) 



where T denotes matrix transposition. Thus, the leading eigenvectors in E/ n) correspond to the leading 
eigenvalues in A/ n> . 

8. Using the /?i leading eigenvectors from Step 7, project the Q- dimensional feature vectors of Step 5 
75 onto a space of dimension /?i . Call the resulting vectors tp. A reasonable value for /?i is 6. At this point 

the redundancy present in the Q - dimensional spliced feature vectors has been eliminated by con- 
centrating on the most informative feature elements. We call the space spanned by the vectors TP the 
chirographic space C* 1) . 

9. Similarly, using the R 2 leading eigenvectors from Step 7, project the Q 1 - dimensional feature vectors 
20 of Step 5 onto a space of dimension R 2l with resulting vectors TP. A reasonable value for R 2 is 15. 

Note that R 2 > /?i because there is generally less redundancy present in the (global features) Q- 
dimensional spliced feature vectors than in the (local features) Q - dimensional spliced feature sectors. 
We call the space spanned by the vectors TP the chirographic space CPK 

10. Repeat this step for n = 1,2. Starting with random cluster assignments, perform K- means Euclidean 
25 clustering of the projected vectors T/ n> obtained in Steps 8 and 9, so as to obtain preliminary prototype 

distributions in the corresponding R n - dimensional chirographic space. 

11. Repeat this step for n = 1.2. Starting with the preliminary distributions of Step 10, perform K- means 
Gaussian clustering of the projected vectors TP obtained in Steps 8 and 9, so as to obtain final 
Gaussian prototype distributions in both chirographic spaces. Denote these prototype distributions as 

30 v k in \ and use cluster sizes to estimate the prior probability Pn>* (n> ) of each prototype distribution in the 
respective R n - dimensional chirographic space. 

12. Repeat this step for n = 1,2. Using the Gaussian distributions from Step 11, compute, for all vectors 
TP obtained in Steps 8 and 9, the quantity Pr(T / n V* (n) ). Also estimate the probability of each feature 
vector as: 



35 



40 



45 



k= 1 

assuming the total number of clusters in the respective chirographic space is K n . Good choices are /G = 
K 2 = 400. 

13. Repeat this step for n = 1,2. Using the results of Steps 11 and 12, compute the quantity: 



50 



and note against which character a f each vector TP is aligned in the training data. 

14. Repeat this step for n = 1,2. For each character a f in the vocabulary considered, pool together all 
the TP which have been aligned against it and accumulate the corresponding Pn>*< n) |r / n >). After 
normalization, this provides an estimate of Pr^wja/) the prior probability of each prototype distribution in 
the respective chirographic space given each character a,. This is the end of the training phase. 

15. Repeat Steps 1 -5 and 8-9 on the test data, so as to produce test feature vectors in the same 
respective chirographic spaces as the training data. 
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16. For each frame of data //, represented in the chirographic space C< 1> by t P and in the chirographic 
space 0 2) by TP, use the Gaussian mixture distributions obtained in Step 11 and the prior probabilities 
obtained in Step 14 to form the quantity: 



, i.e., the weighted product of two single Gaussian mixture distributions covering the entire chirographic 
label alphabet. In this expressions, a controls the influence of the second codebook relative to the first. A 
is good value for a is 0.7. It remains to multiply the scores of successive frames to obtain the overall score 
for a tentative sequence of frames, thus completing the decoding process. 

Refer now to Fig. 3, which is a block diagram of a handwriting recognition system according to the 
present invention for implementing the just recited algorithm. A general purpose computer 10, which for 
example may be an IBM 3090/VF or an IBM RS 6000, receives character or stroke information produced by 
20 a user using a stylus 12 to write on an electronic tablet 14. The character or stroke information may be 
displayed on the electronic tablet 14 or another display device (not shown). The computer 10 can be used 
either in training mode 16 or in decoding mode 18. 

This is symbolically represented by a dotted line 20. In either training or decoding mode a*front-end 
parameter extraction block 22 must be applied. In training mode 16 the algorithm includes a prototype 

25 construction block 24 and a language model block 26. In decoding mode 18 the algorithm includes a 
likelihood estimator 28 and a decoder 30. 

The blocks 22 - 30 arc shown as functional program modules, however it is to be appreciated that some 
or all of these functional blocks may be implemented in hardware form instead of software form, and be 
within the claimed teachings of this invention. 

so The front -end parameter extraction block 22, which performs steps 1 to 9 of the handwriting 
recognition algorithm, provides feature vectors to the prototype construction block 24, during training mode, 
or the likelihood estimator block 28, during decoding mode. The prototype construction block 24 performs 
steps 10-14 of the handwriting recognition algorithm to produce (i) chirographic prototypes representing 
suitable portions of characters and (ii) mixture coefficients indicating how to combine them. This information 

35 is used in the decoding mode to determine or recognize, unknown characters. The language model block 
26 provides language model probabilities which can be used to determine what characters are most likely to 
occur in a given context, or what words are most likely to occur in a given context. In some embodiments, 
block 26 may be reduced to a library of probable combinations of characters, phrases or sentences 
occurring in handwriting. This language model is utilized by the decoder 30 during recognition. 

40 During the recognition mode, likelihood estimator 28, which performs step 16 of the handwriting 
recognition algorithm, receives feature vectors from block 22 which have been produced from the unknown 
strokes or characters to be recognized. These feature vectors lie in the same chirographic space(s) as the 
chirographic prototypes from block 24, and can therefore be compared to each of them to evaluate the 
contribution of each of them to each particular feature vector. This information is integrated using the 

45 mixture coefficients produced during training to compute the likelihood that each particular feature vector 
"belongs" to a character in the alphabet. Over ail the feature vectors, this can be used to produce candidate 
characters for recognition to decoder 30. Decoder 30 integrates into the overall score the language model 
probabilities from block 26 corresponding to the maximum score. Recognized handwriting in accord with the 
recited algorithm is then produced at output 32 of decoder 30. The recognized handwriting may then be 

so displayed on the tablet 14, or may be provided to a utilization device 33, which for example may be a 
display device, printer, application program or the like. 

Refer now to Rg. 4 which is a detailed block diagram of front -end parameter extraction block 22 which 
is shown generally in Fig. 3. Each sampled point of handwriting is represented by a point which is defined 
by coordinates x n and y n , which are provided to a pre -filtering block 34 which performs step 1 of the 

>5 handwriting recognition algorithm. These points are bailistically spaced as shown in Rg. 5. That is, the 
spacing of the points is a function of the velocity or speed of writing which the writer used to form the 
current character. For a variety of reasons, writers are seldom consistent in their velocity or writing speed, 
which may introduce high error rates in handwriting recognition. The pre -filtering block 34 normalizes the 
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points of Fig. 5 to provide equally spaced points x m and y m at the output of block 34, and as shown in Fig 
6. Details of the pre -filtering will be described shortly relative to Figs. 7 and 8. The equally spaced points 
x m and y m comprising a character as shown in Fig. 6, ar provided to a parameter xtraction block 36, 
which performs steps 2 and 3 of the handwriting recognition algorithm, for providing the vector v m . Details 
of this parameter extraction are described shortly relative to Figs. 11, 12, 13, and 14. The vector v m is 
provided to a windowing block 38, which performs steps 4 and 5 of the handwriting recognition algorithm, 
for providing a spliced vector S h Details of how the spliced vector St is provided is described shortly 
relative to Figs. 14, 15 and 16. The spliced vector $ f is provided to a projection block 40, which performs 
steps 6-9 of the handwriting recognition algorithm, for producing a feature vector f h This eliminates 
redundancy in the spliced parameter vectors. Details of the function of block 40 are set forth relative to Fig. 
17. Projection block 40 responds to the spliced vector to provide a feature vector T t which is provided to 
the prototype construction block 24 and the likelihood estimator 28, as previously explained with respect to 
Fig. 3. Details of projection block 40 are set forth relative to the flow chart of Fig. 17. 

Details of how the ballisticalry spaced character of Fig. 5 is normalized by pre -filtering block 34 (Fig. 4) 
to produce the equally spaced character of Fig. 6 is now explained relative to Rgs. 7 and 8 which illustrate 
how step 1 of the handwriting recognition algorithm is performed. Fig. 7 is representative of the upper 1/4 
curved portion of Fig. 5. First, the density of points is increased by performing some interpolation between 
the original raw points (denoted by a dot •)• This results in a sequence of points comprising the original raw 
points (•) and the interpolated points (j). Then, filtering is accomplished by a priori determining that equal 
spacing between points is a distance R suitably related to the distance between two pels as manifested on 
the electronic tablet 14 (Fig. 3). In Rg. 7, this results in a sequence of points after filtering denoted by an X 
(at 56). Raw and interpolated points are considered to be at non- equally spaced integer points n, and 
filtered points are considered to be at equally - spaced integer points m. 

With respect to Rg. 8, at block 42 the position at n - 1 at the first (raw) point 48 of the stroke is 
designated at m = 1, considered also the first filtered point. The second point 50 of the stroke at n = 2 is 
the first point to be tested for filtering. At block 44 the (Euclidean) distance between the points m and n is 
determined according to the relationship: 

distance = (x n -x m | 2 + iy n -y m j 2 

At block 46 a determination is made whether or not distance is greater than R. With reference to Fig. 7 
point m = 1 is point 48 and point n = 2 is point 50. It can be seen that distance is less than R in Rg. 7, 
therefore the point is rejected and we proceed to block 52 where n is incremented to 3, point 54. Distance 
is again computed in block 44 and compared with R in block 46. Eventually the distance becomes greater 
than R f so the point 55 is accepted (m is made equal to n in block 58). At block 60 the point (x„,y n ) is 
stored as a filtered point (x m ,y m ) which is the point 56, which is the 12th point. At block 62 n is incremented 
by 1 , and a return is made to block 44 where raw and interpolated points are treated as explained above. 

Rgs. 9 and 10 illustrate how a feature vector is derived in the prior art, in reference to a center of 
gravity (CG) of a stroke. The input stroke, by example a lower-case "a", is captured by the electronic 
tablet. Each feature vector includes elements expressive of an x and a y displacement of a sample point (P) 
on the stroke, relative to the center of gravity, a displacement (b) of P relative to a baseline (BL), and 
angular information (theta) relating to a line (L) tangent to P. The resulting feature vector, for this presently 
preferred embodiment, has four elements (Rg. 10) and thus represents a four dimensional (4-d) vector. 

Rgs. 11, 12, 13 and 14 illustrate how parameter extraction, block 36 of Fig. 4, which performs steps 2 
and 3 of the handwriting recognition algorithm, is derived according to this invention for providing a 
parameter vector vs;. Rg. 11 shows the local parameter extraction, Rg. 12 the local parameter vector, Rg. 
13 is the global parameter extraction, and Rg. 14 the global parameter vector. There are 6 local coordinates 
in the local parameter vector and 3 global coordinates in the global parameter vector, for a total of 9 
coordinates. That is, 5 more coordinates are generated than the 4 coordinates of the prior art as shown in 
Rg. 10. For the local parameter vector, calculations are made relative to a current point 64 relative to 
previous points 66 and 67 and following points 68 and 69. The specific calculation for local parameter 
vectors are shown in Rg. 12. For the global parameter vector, calculations are made relative to a current 
point 64 relative to baseline 65, initial point of the character 66, last point of the first stroke 67, and first 
point of the second stroke 68. The specific calculation for global parameter vector are shown in Rg. 14. 
Without loss of generality, the rest of this section will illustrate the handwriting recognition algorithms for one 
codebook only, i.e., either the local parameter vectors or the global parameter vectors. 

Details of the windowing block 38 of Fig. 4 are now set forth relative to Rgs. 15 and 16 to show how 
feature events are extracted from the data. A small number of approximately equidistant feature points are 
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determined using the same algorithm as in Fig. 8, but with a different value of R, and parameter vectors are 
spliced at those points. The number (2H + 1) of parameter vectors to be spliced at each point is 
determined a priori, which in turn specifies the splicing dimension Q = (2H + 1)P. 

Referring to Fig. 15. feature points are shown by dots, and window centers are shown by an X. Dots are 
referenced as points k, and >Cs are referenced by index i as points k h With respect to Fig. 16, at block 70 i 
and a counter j are each set equal to 1. At block 72, k is set to k,- H. and at block 74 the corresponding v k 
(of dimension P) is gotten. A determination is then made at block 76 whether or not (2H + 1) v k have been 
seen. If so, j is reinitialized to 1 and_[ is incremented by 1 in block 78 and the procedure repeats as just 
explained. If not, v k is appended to V, starting at position fl-1) P + 1. k and j are both incremented by 1 
in block 82 and a return is made to block 74 to get the next , h and the procedure repeats as just explained. 

Referring to Fig. 17 the function of the projection block 40 of Fig. 4, which performs steps 6-9 of the 
handwriting recognition algorithm, is explained in detail. The projection block is utilized to eliminate 
redundancy in the spliced parameter vectors from the windowing block 38. A covariance matrix is computed 
for all spliced vectors in block 71, and the associated eigenvalue and eigenvectors are found through 
principal component analysis, in block 75. Using the R leading eigenvalues and eigenvectors of block 74, 
the spliced vectors are projected in block 77 onto a subspace of smaller dimension called chirographic 
space, resulting in the projected vectors r /. How a covariance matrix is computed is described in "Matrix 
Computations" by J.H. Golub and C.F. Van Loan, John Hopkins, University Press, Baltimore, 1989. This 
references also teaches how to perform a principal component analysis at block 73, and how to project all 
S, at block 77. 

The chirographic space is then partioned as shown in Fig. 18 and 19, which details the prototype 
construction block 24 of Rg. 3, to produce chirographic prototypes. The feature vectors are provided to 
block 79 to perform k- means Euclidean clustering. Details of block 79 are set forth relative to Rgs. 19 and 
20. The results of Euclidean clustering are provided to block 81 to perform k- means Gaussian clustering to 
provide prototype distributions * k . Details of block 81 are set forth relative to Fig. 21. Rgs. 18-21 detail 
how steps 10 and 11 of the handwriting recognition algorithm are performed. The prototype distributions or 
chirographic prototypes arc provided to likelihood estimator 28 (Fig. 3) to produce candidate characters to 
decoder 30 (Fig. 3). How to generally accomplish k- means clustering is described in -Clustering 
Algorithms" by J.A. Hartigan, J. Wiley, 1975. 

Rg. 19 illustrates a space 83 which is divided into clusters 84, 86 and 88. Each cluster includes a 
plurality of vectors indicated as points x, with a centroid ® being computed for each such cluster of vectors. 

Refer to Fig. 20 which details block 79 of Rg. 18. A number of random seeds, chosen to be 250, is 
picked at block 90 from all the points in the chirographic space obtained from block 22 in Rg. 3. The 
Euclidean distance between each point and each seed is calculated at block 92. By assigning each point to 
its closest seed, the space is partioned into clusters at block 94. This corresponds to the clusters 84, 86 
and 88 of Rg. 19. The centroid of each cluster is computed at block 96. This corresponds to the ® in Fig. 
19. These centroids are set to replace the original seeds at block 98. At decision block 100 a determination 
is made if the maximum number of iterations is reached, tf not, a return is made to block 92 and the steps 
are repeated as just described. If so, the calculation of the Euclidean clustering is complete. 

Refer to Rg. 21 which details the Gaussian clustering block 81 of Fig. 18. The Euclidean clusters 
obtained in block 79 (Rg. 18) are provided at block 102. The Gaussian distance between each point and 
each centroid is calculated at block 104. By assigning each point to its closest centroid, the space is 
partioned into clusters at block 106. The new centroid of each cluster is computed at block 108. At decision 
block 110 a determination is made if the maximum number of iterations is complete. If not, a return is made 
to block 104 and the steps are repeated as just described. If so, the calculations of the Gaussian clustering 
is complete. This results in final prototype distributions in chirographic space. 

Refer to Rg. 22, which performs step 16 of the handwriting recognition algorithm for one codebook 
only, which illustrates how information resulting from steps 12 and 13 of the handwriting recognition 
algorithm is operated on by the likelihood estimator 28 of Rg. 3 to produce candidate characters for the 
decoder 30. At block 114 a variable i which is indicative of the current frame (or window center), is 
initialized to 1, and the test feature vector n is provided from the front end parameter extraction 22 (Rg. 4) 
as indicated at block 116. At block 118 a variable k representative of the current prototype distribution is 
initialized to k = 1. The conditional probability of this feature vector given this prototype distribution is 
computed at block 120 and is provided to block 122. 

The prototype construction block 24 (Rg. 3) of the training phase as represented by the chirographic 
prototype distributions n* in block 124 and mixture coefficients P/<n*ja/) in block 126 are also provided to 
block 122 where the combined probability is computed and stored. At decision block 128 a determination is 
made if k has reached the maximum number of clusters. If not, k is incremented by 1 as indicated at block 
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130. and a return is made to block 120 and the just recited process is repeated. If so, the scores just stored 
are accumulated at block 132 for all characters a f in the underlying alphabet. At decision block 134 a 
determination is made if all frames i have been seen for the current character under consideration. If not. i 
is incremented by 1 at block 136 and a return is made to block 116 and the just recited process is 
repeated. If so, the accumulated scores are ordered in block 138 and a candidate list of characters a, is 
formed from the top J scores for provision to the decoder 30 (Fig. 3). 

Refer now to Fig. 23 which is a flow chart representation indicative of the operation of the decoder 30 
(Fig. 3). A variable t, which is indicative of the current character under consideration is initialized to 1 at 
block 142. The candidate list of characters from the likelihood estimator 28 (Rg. 3) for character Q is 
provided at block 144. A variable j indicative of the current candidate character is initialized at 1 at block 
146, and C, is tentatively set equal to a, at block 148. From the training block, Language Model probabilities 
26 (Rg. 3) are provided at block 150. Based on these probabilities and previously recognized characters at 
block 154, the final score of the character a; is computed at block 152. This score represents the likelihood 
that Ct is recognized as a, taking into account the contextual information through the language model. At 
decision block 156 a determination is made if j = J f the index of the last candidate character in the 
candidate list provided by the likelihood estimator 28, If not, a return is made to block 148 and the just 
recited process is repeated. If so, final scores incorporating language model probabilities are ordered in 
block 158. The top candidate is selected as the recognized answer for character Ct in block 160. At decision 
block 162 a determination is made if t = Tmax, the index of the last character in the string to be 
recognized. If not, t is incremented by 1 in block 164, to get the next character to be recognized. An update 
is made at block 166. to insert the recognized Q in block 154. and a return is made to block 144, with the 
just recited process being repeated. If so, the process is complete as indicated at block 168 and the whole 
string of characters has been recognized. 

Claims 

1. In a handwriting recognition system, the combination comprising: 
means for sampling handwriting inputs from at least one writer; 

means for signal processing said inputs to extract both local and global feature elements; 
means for windowing said inputs to detect broad feature events; 
means for rotating and projecting said inputs to eliminate redundancy; 

means for performing clustering in a feature vector space to provide prototype distributions in said 
feature vector space; 

means for performing Gaussian modeling in said feature vector space; 

means for computing mixture coefficients to evaluate the relative contribution of each prototype 
distribution to the current sample handwriting inputs; and 

means for performing mixture decoding in said feature vector space to produce recognized handwriting; 

2. In a handwriting recognition system, the combination comprising: 
means for sampling known handwriting inputs from at least one writer; 

means for providing a parameter vector representation of said handwriting inputs; 
means for providing a spliced vector representation of said handwriting inputs; and 
means for providing a feature vector representation of said handwriting inputs. 

3. The combination claimed in claim 2, including: 

means for performing clustering in a feature vector space to provide prototype distributions in said 
feature vector space; 

means for performing Gaussian modeling in said feature vector space; 

means for computing mixture coefficients to evaluate the relative contribution of each prototype 
distribution to the current sample handwriting inputs. 

4. The combination claimed in claim 2 or 3, including: 

means for sampling an unknown handwriting input from the same or another writer; 
means for providing a feature vector representation of said unknown handwriting input; 
means for estimating the likelihood that the feature vector representation corresponds to each character 
in the underlying alphabet, utilizing said chirographic prototype distributions and said mixture coeffi- 
cients; 

said combination preferably further comprising: 
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means for performing mixture decoding in said feature vector space to produce recognized handwriting; 
means for utilizing multiple feature vector spaces for producing a more robust recognition. 

5. In a handwriting recognition system, the combination comprising: 

an electronic tablet which can be written on with a stylus, and on which writing can be displayed; 
means for sampling known characters comprising handwriting inputs from at least one writer, in 
response to said one writer writing on said tablet with said stylus; 

means for providing a feature vector representation in chirographic space of the sampled known 
characters; 

means for providing chirographic prototype distributions and mixture coefficients in response to the 

provision of the feature vector representation in chirographic space of the sampled known characters; 

means for sampling unknown characters comprising handwriting inputs to be recognized from the same 

or other writers, in response to said writers writing on said tablet with said stylus; 

means for providing a feature vector representation in said chirographic space of the sampled unknown 

characters; 

means for estimating the likelihood that at least one character of the underlying alphabet is a candidate 
character for the recognition of each unknown character, based on a probabilistic comparison of said 
chirographic prototype distributions, appropriately weighted by said mixture coefficients, and said 
parameter vector representation in chirographic space of the sampled unknown characters, including 
means for making a list of candidate characters; and 

means for performing a probabilistic analysis of said list of candidate characters to recognize the 
sampled unknown characters comprising the handwriting inputs to be recognized; 
said combination preferably further comprising: 

means for displaying the recognized characters on said electronic tablet and/or 

means for providing a library of probable combinations of at least characters occurring in handwriting; 

and 

wherein said means for performing a probabilistic analysis includes means for making a probabilistic 
comparison of said list of candidate characters and said library of probable combinations of at least 
characters occurring in handwriting, to recognize the sampled unknown characters comprising the 
handwriting inputs to be recognized. 

6. In a handwriting recognition system, the combination comprising: 

an electronic tablet which can be written on with a stylus, and on which writing can be displayed; 
means for sampling handwriting inputs from at least one writer, in response to said one writer writing on 
said tablet with said stylus; 

means for providing a parameter vector representation of said handwriting inputs; 

means for providing a spliced vector representation of said handwriting inputs; 

means for computing a covariance matrix of said spliced vector representation; 

means for calculating eigenvalues and eigenvectors associated with the computed covariance matrix; 

means for applying a linear transformation along the direction of the leading eigenvectors, this providing 

a feature vector representative of said handwriting inputs; 

said combination preferably further comprising: 

means for performing clustering in a feature vector space to provide prototype distributions in said 
feature vector space; 

means for performing Gaussian modeling in said feature vector space; 

means for computing mixture coefficients to evaluate the relative contribution of each prototype 
distribution to the current sample handwriting inputs and/or 

means for recognizing unknown handwriting in response to a probabilistic comparison of the unknown 
handwriting in chirographic space with said chirographic prototypes, as combined in accordance with 
the mixture coefficients. 

7. In a handwriting recognition system, a method comprising the steps of: 
sampling handwriting inputs from at least one writer; 

signal processing said inputs to extract both local and global feature elements; 
windowing said inputs to detect broad feature events; 
rotating and projecting said inputs to eliminate redundancy; 

performing clustering in a feature vector space to provide prototype distributions in said feature vector 
space; 
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performing Gaussian modeling in said feature vector space; 

computing mixture coefficients to evaluate to relative contribution of each prototype distribution to the 
current sample handwriting inputs; and 

performing mixture decoding in said feature vector space to produce recognized handwriting. 

8. In a handwriting recognition system, a method comprising the steps of: 
sampling known handwriting inputs from at least one writer; 
providing a parameter vector representation of said handwriting inputs; 
providing a spliced vector representation of said handwriting inputs; and 
providing a feature vector representation of said handwriting inputs; 
said method preferably further comprising the steps of: 

performing clustering in a feature vector space to provide prototype distributions in said feature vector 
space; 

performing Gaussian modeling in said feature vector space; 

computing mixture coefficients to evaluate the relative contribution of each prototype distribution to the 
current sample handwriting inputs and/or sampling an unknown handwriting input from the same or 
another writer; 

providing a feature vector representation of said unknown handwriting input; 

estimating the likelihood that the feature vector representation corresponds to each character in the 
underlying alphabet, utilizing said chirographic prototype distributions and said mixture coefficients 
and/or 

performing mixture decoding in said feature vector space to produce recognized handwriting; 
utilizing multiple feature vector spaces for producing a more robust recognition. 

9. In a handwriting recognition system, a method comprising the steps of: 

sampling known characters comprising handwriting inputs from at least one writer, in response to said 
one writer, in response to said one writer writing on an electronic tablet with a stylus; 
providing a parameter vector representation in chirographic space of the sampled know characters; 
providing chirographic prototypes in response to the provision of the parameter vector representation in 
chirographic space of the sampled known characters; 

sampling unknown characters comprising handwriting inputs to be recognized from said at least one < 
writer, in response to said one writer writing on said tablet with said stylus; 

providing a parameter vector representation in said chirographic space of the sampled unknown 
characters; 

estimating the likelihood that at least one of said chirographic prototypes is a candidate character for 
being recognized as an unknown character, based on a probabilistic comparison of said chirographic 
prototypes, and said parameter vector representation in chirographic space of the sampled unknown 
characters, including making a list of candidate characters; and 

performing a probabilistic analysis of said list of candidate characters to recognize the sampled 

unknown characters comprising the handwriting inputs to be recognized; 

said method preferably further comprising the steps of: 

displaying the recognized characters on said electronic tablet and/or 

providing a library of probable combinations of at least characters occurring in handwriting; and 
wherein performing a probabilistic analysis includes making a probabilistic comparison of said list of 
candidate characters and said library of probable combinations of at least characters occurring in 
handwriting to recognize the sampled unknown characters comprising the handwriting inputs to be 
recognized. 

10. In a handwriting recognition system, a method comprising the steps of: 

sampling handwriting inputs from at least one writer, in response to said one writer writing on an 
electronic tablet with a stylus; 

computing a covariance matrix of the sampled handwriting; 

calculating eigenvectors associated with the computed covariance matrix; 

applying a linear transformation along the direction of the eigenvector; and 

producing chirographic prototypes of the sampled handwriting in response to the application of the 
linear transformation; 

said method preferably further comprising the steps of: 

providing mixture coefficients indicative of how to combine the chirographic prototypes for handwriting 
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recognition and/or 

recognizing unknown handwriting in response to comparing the unknown handwriting to the chirog 
raphic prototypes as combined in accordance with the mixture coefficients. 
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